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The invention relates to a method, a device and a transmitter, in which an image 
signal is encoded to obtain a bit-stream. The invention further relates to transcoding. The 
invention also relates to a receiver, a bit-stream and a storage medium. 

5 

EP-0 514 663 A2 discloses an apparatus and method for motion video encoding 
employing an adaptive quantizer. The MPEG video standard defines a layered architecture for 
compressing a video sequence. First, a sequence of video pictures is subdivided into disjoint 
Groups Of Pictures (GOPs). Each GOP is compressed independently of other GOP's to 

1 0 facilitate random access to any picture and also to limit the propagation of transmission errors. 
Every picture in a GOP is subdivided into Macro-Blocks (MBs). For a color picture, an MB is 
a collection of 16x16 luminance pixels and two 8x8 blocks of chrominance pixels. In MPEG, 
the two chrominance components are both sampled at half the horizontal and vertical 
resolution of the luminance. As such, an MB completely describes a 16x16 color segment of a 

15 picture. In an MB the 16x16 luminance pixels are further subdivided into four luminance 

blocks of 8x8 pixels. The still or motion picture data in the form of MBs, represented by either 
the actual pixel data (intra-mode) or only the error data (predictive mode), is inputted to a 
transformation. This transformation is a two-dimensional Discrete Cosine Transform (DCT) 
applied to each of the MBs. After applying the DCT, the resulting coefficients undergo a 

20 quantization step. The DCT transform coefficients are uniformly quantized with a matrix of 
quantization steps. MPEG specifies one of two reference matrices from which the quantization 
steps may be derived. The choice of which. matrix is used, depends on the MB mode. Although 
the reference matrices can be defined by the encoder at the beginning of a video sequence, 
they remain fixed afterwards. MPEG allows dynamic changes to the matrix of quantization 

25 steps, however, by allowing a scaling factor for the reference matrices. This scaling factor can 
be changed for every MB. The adaptation of this scaling factor is performed on an MB to MB 
basis and varies based on the complexity of the image and the available rate control 
requirements. 
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2 11.01.2000 
An object of the invention is to provide further compression. To this end, the 
invention provides a method, a device, a transmitter, a transcoder, a receiver, a bit-stream and 
a storage medium as defined in the independent claims. Advantageous embodiments are 
defined in the dependent claims. 
5 In a first embodiment of the invention, blocks of transform coefficients are 

provided and high-frequency transform coefficients are attenuated. The invention is based on 
the insight that especially for obtaining low bit-rates, attenuating high-frequency transform 
coefficients is more advantageous than increasing a step-size for all transform coefficients. 
Increasing the step-size causes accumulation of errors in predictively coded pictures, because 
10 each predictively coded picture is predicted from a previous picture. In this way, errors will 
accumulate over a GOP. A resulting coding artifact can be seen as "breathing" of the picture 
quality over a time interval of a GOP which is normally about half a second. The invention 
provides further compression by attenuation of high-frequency coefficients. Lower frequencies 
are less or not attenuated. Preferably, some of the coefficients of lower frequencies remain 
15 unaffected. The accumulation of errors is herewith reduced. 

For attenuating high frequency coefficients, the invention provides a curve with 
higher quantization step for higher-frequency transform coefficients. Because no curve 
information is put in the resulting bit-stream, the reconstruction is performed by using an 
original quantization scaling factor only. The quantization according to the invention can be 
20 seen as additional quantization, which is not taken into account in a reconstruction. Therefore, 
a reconstructed coefficient will have a lower value than an original coefficient. 

The bit rate can easily be regulated by shifting the curve to lower or higher 
frequencies and/or multiplying the curve. 

The aforementioned and other aspects of the invention will be apparent from 
25 and elucidated with reference to the embodiments described hereinafter. 



In the drawings: 

Fig. 1 shows a block of transform coefficients; 

Fig. 2 shows an illustration of perceived picture quality as function of time 
30 when a higher step-size is used to encode an image signal; 

Fig. 3 shows a preferred quantization curve according to the invention; 
Fig. 4 shows an embodiment of a transmitter according to the invention; and 
Fig. 5 shows an embodiment of a receiver according to the invention. 
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The drawings only show those elements that are necessary to understand the 



Fig. 1 shows a block of transform coefficients. These coefficients C\ are 
obtained by transforming a block of data from a spatial domain to a frequency domain. In this 
example, a picture is encoded per block of 8 by 8 pixels. The result of the transformation is a 
block of 8 by 8 transform coefficients (so i= 0...63). From the left to the right in the block, the 
horizontal spatial frequency increases. From the top to the bottom, the vertical spatial 
frequency increases. The upper-left coefficient Co represents a DC coefficient. Although 
various transformations may be used, preferably DCT transformation is performed. In Fig. 1, a 
zigzag scanning is used to obtain an order of transform coefficients Q. Although another 
scanning may be used, zigzag scanning is widely used, e.g. in MPEG. 

In MPEG, coefficients Q are quantized by dividing them by a weight in a 
predefined quantization matrix and by a quantization scaling factor. The quantization scaling 
factor is adapted per macro block. The weights in the quantization matrix are so arranged that 
the less important coefficients are coded with less accuracy. 

One way of reducing a bit rate would be to use a higher quantization scaling 
factor. Errors introduced by using a higher quantization scaling factor affect the picture 
quality. In a predictive coding method, like MPEG, errors in a given picture affect another 
picture when the given picture is used to predict the other picture. The degradation of intra- 
coded pictures is usually limited. 

An error made in a picture that is used as a predictor is forwarded to a predicted 
picture. Accumulation of errors takes place, depending on the number of predictions that is 
made. The accumulation of errors gives rise to annoying effects. An effect is illustrated in Fig. 
2. A Perceived picture quality Ppq is shown as function of time t. The perceived picture 
quality Ppq between a beginning and an end of a GOP differs significantly. This effect may be 
called "breathing", i.e. the Ppq decreases during a GOP and increases suddenly with a new, 
refreshing I picture. 

Attenuating high-frequency coefficients C, according to the invention reduces 
this accumulation of errors. The coefficients are quantized with an additional quantization step 
size Qadd- Because this additional quantization step-size is not put in the resulting bit-stream, 
the reconstruction is performed with the quantization scaling factor and the quantization 
matrix without taking into account the additional quantization. Therefore, a reconstructed 
coefficient will have a lower value than an original coefficient Q. As a result, the coefficient 
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Cj has been attenuated. To reflect the lower importance of higher frequencies, the DCT 
coefficients are quantized with a variable quantization step depending on their position. Fig. 3 
shows a curve QC showing the additional quantization step size Qadd as a function of a 
position of the quantized transform coefficient C,. The quantization step Qadd gradually 
5 increases for higher-frequency transform coefficients Cj. The lowest coefficients C, are not 
affected, keeping their value from the incoming stream. This is done to protect the low 
frequencies to which the eye is very sensible. In a practical embodiment 8 coefficients always 
remain unaffected. Keeping the lower frequencies unaffected, is advantageous in preventing 
the accumulation of errors. 
10 It would have been possible to remove every coefficient after a given position 

in the DCT coefficient order. However, due to a particularity of the DCT, that would have 
given rise to a deterioration of lower frequencies, In fact, the DCT is a kind of bank filter, 
which extracts a value for a range of frequencies, i.e. a DCT coefficient. The bank filter is 
however imperfect in that frequency ranges overlap each other. This means that a DCT 

1 5 coefficient contains energy in a quite large band. Therefore, quantizing a coefficient has some 
repercussions on lower frequencies. Thus, removing complete coefficients after a given 
position would not only remove the energy contained in high frequencies but also some energy 
contained in lower frequencies, because even a high frequency coefficient contains energy in 
lower frequencies. Attenuating is therefore better than removing transform coefficients Q. A 

20 smoothly increasing step-size is advantageous regarding the removing of energy from lower 
frequencies. In a practical embodiment, a logarithmic function is used. 

For bit rate control, the curve QC may be shifted. The curve QC is shifted to the 
left to reduce the bit rate and shifted to the right to increase the bit rate. Preferably, some lower 
frequency transform coefficients should not be additionally quantized. If lower frequencies 

25 remain untouched, less errors are accumulated. 

For further bit rate control, the curve QC may be multiplied by a certain 
multiplier. In case the multiplier is larger than one, the coefficients are more coarsely 
quantized and the bit rate is further reduced. In case the multiplier is smaller than one, the 
coefficients are less coarsely quantized and the bit rate is increased. Shifting and multiplying 

30 may be performed in combination or independently. 

Advantage may be taken from the prediction tree to avoid the propagation of 
errors. Instead of quantizing pictures heavily at the beginning of a prediction sequence, it is 
advantageous to quantize last pictures in a prediction sequence more heavily. In this way, 
predictions are made from rather good images. In a GOP with one I (intra-coded) picture 
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followed by P (predictively coded) pictures, a last P picture may be quantized more heavily. 
Since about 25 pictures are displayed per second, one picture is displayed for only 0.04 
second. During this very short lap of time, the human eye does not observe a drop in quality. 
Squeezing the last P picture is possible resulting in an extreme low number of bits used. The 
5 current P picture is of low quality, but this is no problem because the viewer does not notice 
the drop in quality and the last P picture is not used to predict another picture. This strategy 
can also be used for B (bi-directional predictively coded) pictures as they are also at an end of 
a prediction sequence. Generally, the B-pictures cannot be quantized as coarsely as the last P- 
picture, since there are more than one B pictures in a GOP and repetitive drops in quality 
10 within a GOP tend to lower the global quality of the image as perceived. This strategy does 
not cause a breathing effect because errors that are forwarded to other pictures are smaller. 
Although it is possible to quantize more than one last P-picture more coarsely, this is less 
advantageous. 

As discussed above, quantizing pictures more coarsely can be achieved by 

1 5 shifting or multiplying the quantization curve. 

Fig. 4- shows an embodiment of a transmitter according to the invention. The 

transmitter 1 comprises means for obtaining a signal S, e.g. a camera 10. The transmitter 
further comprises an encoding device 2. Although not necessary to the invention, the encoding 
device preferably encodes the image signal by using motion estimation and motion 

20 compensation. The device 2 therefore comprises a motion compensator 1 1 and a motion 

estimator 18. The motion compensated signal is transform coded in transform coder 12. This 
transform coder preferably applies a DCT transformation on the motion compensated signal. 
After the transformation, the transformed signal is quantized in quantizer Q 13. This is a 
standard quantization as is known from the prior art. To perform motion estimation, a 

25 reconstructed picture is necessary. Therefore, the device 2 comprises an inverse quantizer IQ 
14 and an inverse transform coder IDCT 15 that perform inverse operations of Q 13 and DCT 
12 respectively. In the inverse motion compensator IMC 16 that is coupled to the ME 18 the 
reconstructed picture is obtained. The reconstructed picture is stored in the picture memory 
MEM 17. The reconstructed picture is used for motion estimation in a next picture that has to 

30 be encoded. The quantized motion compensated signal as obtained in Q 13 is furnished to an 
additional quantizer Qadd 19 according to the invention. The Qadd 19 applies an additional 
quantization to the signal with a curve as shown in Fig. 3. The signal is thereafter encoded in 
the Variable Length Encoder VLE 20 to obtain a video elementary stream V. A bit rate R of 
the video stream V is derived in VLE 20 and furnished to CPU 21. The CPU 21 is coupled to 
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Qadd 19 to control the bit rate by shifting and/ or multiplying the curve. The CPU 21 also 
controls other parts of the device 2. Qadd may be defined for blocks of 8x8 pixels. The 
quantization scale factor used in Q 13 is defined per macroblock. 

The Qadd is preferably located outside the encoding loop MC 1 1...ME 18. This 
means that e.g. for motion compensation, a better quality is available than from the 
additionally quantized signal. This is important, because a good quality of the reconstructed 
picture means that better encoding can be obtained, in the case of motion compensation due to 
a better motion estimation. The better motion estimation is performed, the better is the 
compression of the signal S. The video stream V is multiplexed with other streams, e.g. an 
audio elementary stream A in a multiplexer 22 to obtain a transport/ program stream TS. 

Coding bit-streams from an original bit-rate Rl to a new bit-rate R2 is called 
bit-rate transcoding. Bit-rate transcoding is e.g. performed to unify incoming bit-streams that 
vary in bit-rate. Further, the bit rates used for broadcasting are usually too high for storage on 
digital storage media, given the condition that enough playing time should be available. 
1 5 Therefore, bit rate reduction is often necessary. Furthermore, for implementation of long-play 
mode a low bit rate is required. The invention is preferably applied in a bit rate transcoder. 
Such a bit rate transcoder may be included in a digital video recorder. 

Fig. 5 shows an embodiment of a receiver (e.g. digital video recorder, like D- 
VHS). The receiver 3 comprises receiving means 4 and a bit rate transcoder 5. An incoming 
20 transport/ program stream TS 1 is received in receiving means 4, e.g. an antenna system, and 
thereafter demultiplexed in a demultiplexer 50 to obtain a video elementary stream VI and an 
audio elementary stream Al. The transport stream TS1 may be transmitted by an embodiment 
as shown in Fig. 4. The video elementary stream VI is passed to a Variable Length Decoder 
(VLD) 51 to obtain blocks of transform coefficients Q. The audio elementary stream Al is 
25 bypassed because transcoding an audio elementary stream will hardly affect the bit rate. After 
the VLD 51, the decoded video stream is furnished to a quantizer Qadd 52 according to the 
invention. In Qadd 52, the higher frequency coefficients are attenuated as explained above. 
The additional quantized video stream is furnished to a Variable Length Encoder (VLE) 53 to 
obtain a new video elementary stream V2. As in Fig. 4, a bit rate R2 is derived in the VLE 53 
30 and furnished to a CPU 54 to control the Qadd 52. The new video stream V2 is multiplexed 
together with the parsed audio stream Al in a multiplexer 55 to obtain a new transport stream 
TS2. The new transport stream TS2 has a lower bit rate than the incoming transport stream 
TS1 . The new transport stream TS2 may be stored on a storage medium 6. The storage 
medium 6 may be a DVD, a CD-ROM, a (D-VHS) tape, etc. 
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Due to the extreme low complexity of the proposed algorithm, especially for 
transcoding, it is well suited for applications, which demand high throughput, like High 
Definition bit-streams. HD bit-streams can be re-quantized to a lower bit-rate, while the 
resolution remains the same. A preferred application of the invention reduces the bit-rate of an 
HD-stream for storing the lower bit-rate HD-stream on a storage medium. Another preferred 
application is a long play mode in Standard Definition (SD). 

It should be noted that the above-mentioned embodiments illustrate rather than 
limit the invention, and that those skilled in the art will be able to design many alternative 
embodiments without departing from the scope of the appended claims. The word 'picture' 
also refers to frame, field, etc. In the claims, any reference signs placed between parentheses 
shall not be construed as limiting the claim. The word 'comprising' does not exclude the 
presence of other elements or steps than those listed in a claim. The invention can be 
implemented by means of hardware comprising several distinct elements, and by means of a 
suitably programmed computer. In a device claim enumerating several means, several of these 
means can be embodied by one and the same item of hardware. The mere fact that certain 
— measures-are recited in mutually different dependent claims does not indicate that a 
combination of these measures cannot be used to advantage. 

In summary, a method and a device are provided, wherein a signal is encoded to 
obtain a bit-stream. Blocks of quantized transform coefficients are provided. Transform 
coefficients corresponding to higher frequencies are attenuated. For attenuating high frequency 
coefficients, the invention provides a curve QC with higher quantization step-size Qadd for 
transform coefficients Q corresponding to higher frequencies. Because this additional 
quantization step size Qadd is not put in the resulting bit-stream, a reconstruction is performed 
with an original quantization step step-size, without taking the additional quantization into 
account. Therefore, a reconstructed coefficient will have a lower value than an original 
coefficient Q. Bit rates can easily be regulated by shifting the curve QC to lower or higher 
frequencies and/or multiplying the curve QC. 
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1 . A method of encoding (2) a signal (S) to obtain a bit-stream (V,TS), the method 
comprising the steps of: 

providing (12,13) blocks of quantized transform coefficients (CO; and 
attenuating (19) high-frequency transform coefficients (Q) of a given block. 

2. A method as claimed in claim 1, the step of attenuating comprising: 
quantizing (19) the transform coefficients (CO of the given block with a curve 

(QC) having higher quantization steps (Qadd) for higher-frequency transform coefficients (CO, 
without putting information concerning said curve (QC) into the bit-stream (V,TS). 

3. A method as claimed in claim 2, 

further comprising the step of shifting (19,21) the curve (QC) to adjust a bit rate 
(R) of the bit-stream (V,TS). 

15 4. A method as claimed in claim 2, 

further comprising the step of multiplying (19,21) the curve (QC) to adjust a bit 
rate (R) of the bit-stream (V,TS). 

5. A method as claimed in claim 2, wherein the signal (S) comprises intra-coded 
20 and predictively coded pictures, the method comprising the step of: 

quantizing (19,21) predictively coded pictures at an end of a prediction 
sequence more coarsely than predictively coded pictures at a start of the prediction sequence. 

6. A method as claimed in claim 1, the method further comprising the steps of: 
25 decoding (14,1 5) the blocks of quantized transform coefficients (Q) to obtain a 

reconstructed picture rather than decoding (14,15) the attenuated (19) high-frequency 
transform coefficients (Q). 
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7. A device (2) for encoding a signal (S) to obtain a bit-stream (V,TS), 
comprising: 

means (12,13) for providing blocks of quantized transform coefficients (C,); 

and 

5 means (19) for attenuating high-frequency transform coefficients (Q) of a given 

block. 

8. A transmitter (1) comprising: 

means (10) for obtaining a signal (S); and 
10 a device (2) for encoding the signal (S) as claimed in claim 7. 

9. A method of transcoding (5) an encoded signal (TS1/V1), comprising the steps 
of: 

decoding (50,51) the encoded signal (TS1,V1) to obtain blocks of quantized 
1 5 transform coefficients (Q); and 

attenuating (52) high-frequency transform coefficients (CO of a given block. 

10. A bit rate transcoder (5) for transcoding an encoded signal (TS1,V1) 
comprising: 

20 means (50,5 1) for decoding the encoded signal (TS 1 ,V1) to obtain blocks of 

quantized transform coefficients (Q); and 

means for attenuating (52) high-frequency transform coefficients (C,) of a given 

block, 

25 11. A receiver (3 ) comprising : 

means (4,50) for obtaining an encoded signal (TS1,V1); and 

a bit rate transcoder (5) for transcoding the encoded signal (TS1,V1) as claimed 

in claim 10. 



30 



12. A bit-stream (V,TS,V2,TS2) comprising blocks of quantized transform 

coefficients (Q), wherein high-frequency transform coefficients (Q) of a given block have 
been attenuated. 
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13. A storage medium (6) on which a bit-stream (V,TS,V2,TS2) as claimed in 

claim 12 is stored. 
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A method and a device are provided, in which a signal is encoded to obtain a 
bit-stream. Blocks of quantized transform coefficients are provided. Transform coefficients 
corresponding to higher frequencies are attenuated. For attenuating high frequency 
coefficients, the invention provides a curve (QC) with higher quantization step-size (Qadd) for 
5 transform coefficients (Q) corresponding to higher frequencies. Because this additional 
quantization step size (Qadd) is not put in the resulting bit-stream, the reconstruction is 
performed with an original quantization step-size, without taking the additional quantizing into 
account. Therefore, a reconstructed coefficient will have a lower value than an original 
coefficient (Q). Bit rates can easily be regulated by shifting the curve (QC) to lower or higher 
10 frequencies and/or multiplying the curve (QC). 

Fig. 3 
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